Picture for Feng Zheng

Feng Zheng

ConsistentRFT: Reducing Visual Hallucinations in Flow-based Reinforcement Fine-Tuning

Add code
Feb 03, 2026
Viaarxiv icon

UniPCB: A Unified Vision-Language Benchmark for Open-Ended PCB Quality Inspection

Add code
Jan 27, 2026
Viaarxiv icon

\textsc{NaVIDA}: Vision-Language Navigation with Inverse Dynamics Augmentation

Add code
Jan 26, 2026
Viaarxiv icon

ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes

Add code
Nov 18, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Figure 1 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 2 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 3 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 4 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Viaarxiv icon

ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation

Add code
Sep 16, 2025
Viaarxiv icon

$\mathcal{P}^3$: Toward Versatile Embodied Agents

Add code
Aug 09, 2025
Viaarxiv icon

HCNQA: Enhancing 3D VQA with Hierarchical Concentration Narrowing Supervision

Add code
Jul 02, 2025
Viaarxiv icon

LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization

Add code
Jun 09, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon